AITopics | euclidean norm

Clearly, in order for learning to be possible, we must impose some constraints on the size of the function class. One possibility is to bound the number of parameters (i.e., the dimensions of the matrix W), in which case learnability follows from standard VC-dimension or covering number arguments (see Anthony and Bartlett [1999]).

artificial intelligence, complexity, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Italy > Apulia > Bari (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Learning a Neuron by a Shallow ReLU Network: Dynamics and Implicit Bias for Correlated Inputs

Neural Information Processing SystemsOct-8-2025, 15:17:26 GMT

Our results are based on a detailed non-asymptotic analysis of the dynamics of each hidden neuron throughout the training.

artificial intelligence, machine learning, neuron, (14 more...)

Neural Information Processing Systems

Country:

Europe > Russia (0.04)
Asia > Russia (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Learning a Neuron by a Shallow ReLU Network: Dynamics and Implicit Bias for Correlated Inputs

Neural Information Processing SystemsOct-8-2025, 15:17:23 GMT

Our results are based on a detailed non-asymptotic analysis of the dynamics of each hidden neuron throughout the training.

artificial intelligence, machine learning, training point, (16 more...)

Neural Information Processing Systems

Country:

Europe > Russia (0.04)
Asia > Russia (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

SABER: Uncovering Vulnerabilities in Safety Alignment via Cross-Layer Residual Connection

Joshi, Maithili, Nandi, Palash, Chakraborty, Tanmoy

arXiv.org Artificial IntelligenceSep-22-2025

Large Language Models (LLMs) with safe-alignment training are powerful instruments with robust language comprehension capabilities. These models typically undergo meticulous alignment procedures involving human feedback to ensure the acceptance of safe inputs while rejecting harmful or unsafe ones. However, despite their massive scale and alignment efforts, LLMs remain vulnerable to jailbreak attacks, where malicious users manipulate the model to produce harmful outputs that it was explicitly trained to avoid. In this study, we find that the safety mechanisms in LLMs are predominantly embedded in the middle-to-late layers. Building on this insight, we introduce a novel white-box jailbreak method, SABER (Safety Alignment Bypass via Extra Residuals), which connects two intermediate layers $s$ and $e$ such that $s < e$, through a residual connection. Our approach achieves a 51% improvement over the best-performing baseline on the HarmBench test set. Furthermore, SABER induces only a marginal shift in perplexity when evaluated on the HarmBench validation set. The source code is publicly available at https://github.com/PalGitts/SABER.

large language model, machine learning, saber, (19 more...)

arXiv.org Artificial Intelligence

2509.1606

Country: Europe (0.46)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Filters

Collaborating Authors

euclidean norm

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

18210aa6209b9adfc97b8c17c3741d95-Supplemental-Conference.pdf

18210aa6209b9adfc97b8c17c3741d95-Paper-Conference.pdf

Learning a Neuron by a Shallow ReLU Network: Dynamics and Implicit Bias for Correlated Inputs

4af24e6ce753c181e703f3f0be3b5e20-Paper-Conference.pdf

b7ae8fecf15b8b6c3c69eceae636d203-Supplemental.pdf

18210aa6209b9adfc97b8c17c3741d95-Supplemental-Conference.pdf

Initialization-Dependent Sample Complexity of Linear Predictors and Neural Networks

Learning a Neuron by a Shallow ReLU Network: Dynamics and Implicit Bias for Correlated Inputs

Learning a Neuron by a Shallow ReLU Network: Dynamics and Implicit Bias for Correlated Inputs

SABER: Uncovering Vulnerabilities in Safety Alignment via Cross-Layer Residual Connection